Deep sequencing of HBV pre-S region reveals high heterogeneity of HBV genotypes and associations of word pattern frequencies with HCC
نویسندگان
چکیده
Hepatitis B virus (HBV) infection is a common problem in the world, especially in China. More than 60-80% of hepatocellular carcinoma (HCC) cases can be attributed to HBV infection in high HBV prevalent regions. Although traditional Sanger sequencing has been extensively used to investigate HBV sequences, NGS is becoming more commonly used. Further, it is unknown whether word pattern frequencies of HBV reads by Next Generation Sequencing (NGS) can be used to investigate HBV genotypes and predict HCC status. In this study, we used NGS to sequence the pre-S region of the HBV sequence of 94 HCC patients and 45 chronic HBV (CHB) infected individuals. Word pattern frequencies among the sequence data of all individuals were calculated and compared using the Manhattan distance. The individuals were grouped using principal coordinate analysis (PCoA) and hierarchical clustering. Word pattern frequencies were also used to build prediction models for HCC status using both K-nearest neighbors (KNN) and support vector machine (SVM). We showed the extremely high power of analyzing HBV sequences using word patterns. Our key findings include that the first principal coordinate of the PCoA analysis was highly associated with the fraction of genotype B (or C) sequences and the second principal coordinate was significantly associated with the probability of having HCC. Hierarchical clustering first groups the individuals according to their major genotypes followed by their HCC status. Using cross-validation, high area under the receiver operational characteristic curve (AUC) of around 0.88 for KNN and 0.92 for SVM were obtained. In the independent data set of 46 HCC patients and 31 CHB individuals, a good AUC score of 0.77 was obtained using SVM. It was further shown that 3000 reads for each individual can yield stable prediction results for SVM. Thus, another key finding is that word patterns can be used to predict HCC status with high accuracy. Therefore, our study shows clearly that word pattern frequencies of HBV sequences contain much information about the composition of different HBV genotypes and the HCC status of an individual.
منابع مشابه
Mutations in pre-core and basal-core promoter regions of hepatitis B virus in chronic HBV patients from Golestan, Iran
Objective(s): It has been reported that the mutation of the pre-core (PC) and basal-core promoter (BCP) may play an important role in the development of HBV-related hepatocellular carcinoma (HCC). In this study the PC and BCP mutations were investigated in chronic HBV patients. Materials and Methods:In this study, 120 chronic HBV patients from Golestan, Northeast of Iran who were not vaccinated...
متن کاملDifferent pre-S deletion patterns and their association with hepatitis B virus genotypes
AIM To investigate the associations of different types of pre-S deletions with hepatitis B virus (HBV) genotypes. METHODS The sequences of the pre-S region, basal core promoter (BCP) mutation, and precore (PC) mutation were examined through direct DNA sequencing or clonal analysis and sequencing in 273 HBV carriers, namely 55 asymptomatic carriers, 55 carriers with chronic hepatitis (CH), 55 ...
متن کاملPrevalence of Hepatitis B Virus, Genotypes, and Mutants in HBsAg-Positive Patients in Meerut, India
Background: Genetic changeability of hepatitis B virus (HBV) signifies a challenge for the sensitivity of immunologic and molecular diagnostics. Therefore, knowing the spread of HBV genotypes (GENs) and mutation has considerable impacts on treatment strategies, vaccination program, diagnosis, and prevention. The present study aimed to detect HBV GENs and mutants in HBsAg-positive patients. Meth...
متن کاملCombined mutations in pre-s/surface and core promoter/precore regions of hepatitis B virus increase the risk of hepatocellular carcinoma: a case-control study.
BACKGROUND We sought to investigate the role of sequence variations in pre-S/surface and basal core promoter (BCP)/precore regions of the hepatitis B virus (HBV) in hepatocellular carcinoma (HCC). METHODS The direct sequencing in pre-S/surface and BCP/precore regions of HBV was determined for 80 patients with HCC and 160 control patients with HBV infection. RESULTS Compared with control pat...
متن کاملAssociations of pri-miR-34b/c and pre-miR-196a2 Polymorphisms and Their Multiplicative Interactions with Hepatitis B Virus Mutations with Hepatocellular Carcinoma Risk
BACKGROUND Genetic polymorphisms of pri-miR-34b/c and pre-miR-196a2 have been reported to be associated with the susceptibility to cancers. However, the effect of these polymorphisms and their interactions with hepatitis B virus (HBV) mutations on the development of hepatocellular carcinoma (HCC) remains largely unknown. We hypothesized that these polymorphisms might interact with the HBV mutat...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره 14 شماره
صفحات -
تاریخ انتشار 2018